General c-Means Clustering Model and Its Application
نویسنده
چکیده
Considered many partitional clustering algorithms are originated from the definition of mean, we propose a new clustering model---general c-means clustering algorithm (GCM). Generally, when the data set is clustered into c (c>1) subsets, each subset is often expected to have a different prototype (or cluster center) than others. Therefore, we propose the definition of undesirable solution of clustering algorithms. As the GCM has undesirable solution under a mild condition, undesirable solution of the GCM is not expected to be stable. According to these assumptions, we obtain the necessary conditions for the GCM as a good clustering model. Fortunately, such conditions have offered a theoretical basis for selection of the parameters in many clustering algorithms, which is an open problem for such algorithms, for example, we get the theoretical rule for selection of the weighting exponent in the FCM, and explain why the weighting exponent should be greater than 1, etc. Moreover, we discover the relation between the GCM model and Occam’s razor, which offers the deep reason behind many famous partitional clustering algorithms. Based on these results, we can study many objective function based clustering algorithms.
منابع مشابه
A Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data
The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...
متن کاملApplication of Pattern Recognition Algorithms for Clustering Power System to Voltage Control Areas and Comparison of Their Results
Finding the collapse susceptible portion of a power system is one of the purposes of voltage stability analysis. This part which is a voltage control area is called the voltage weak area. Determining the weak area and adjecent voltage control areas has special importance in the improvement of voltage stability. Designing an on-line corrective control requires the voltage weak area to be determi...
متن کاملApplication of Pattern Recognition Algorithms for Clustering Power System to Voltage Control Areas and Comparison of Their Results
Finding the collapse susceptible portion of a power system is one of the purposes of voltage stability analysis. This part which is a voltage control area is called the voltage weak area. Determining the weak area and adjecent voltage control areas has special importance in the improvement of voltage stability. Designing an on-line corrective control requires the voltage weak area to be determi...
متن کاملFuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition
In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...
متن کاملOPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM
This paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (FPSO) and fuzzy c-means (FCM) algorithms, to solve the fuzzyclustering problem, especially for large sizes. When the problem becomes large, theFCM algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. The PSO algorithm does find ago...
متن کاملADAPTIVE NEURO FUZZY INFERENCE SYSTEM BASED ON FUZZY C–MEANS CLUSTERING ALGORITHM, A TECHNIQUE FOR ESTIMATION OF TBM PENETRATION RATE
The tunnel boring machine (TBM) penetration rate estimation is one of the crucial and complex tasks encountered frequently to excavate the mechanical tunnels. Estimating the machine penetration rate may reduce the risks related to high capital costs typical for excavation operation. Thus establishing a relationship between rock properties and TBM pe...
متن کامل